Applying Statistical Models for The Sample Data Production Process of Data Warehouse

نویسندگان

  • Thanh N. HUYNH
  • Thanh N. Huynh
چکیده

For data warehouse and OLAP (On-Line Analytical Processing) systems various preparation tasks are necessary to qualify and improve the effectiveness of their usage. In order to put into operation such a task the creation of statistical sound sample data is indispensable. Applying statistical models for the generation of sample data promises a high-quality prospect which allows the production of sample data based on different usercontrollable statistical parameters. The sample data, which follows the defined statistical requirements, can be conveniently used for testing, benchmarking, demonstrating and training in the fields of data warehouse. In this paper, we address the activities of generation sample data; classify the generating methods and using BEDAWA [3] tool as illustration purposing to generate sample data. Furthermore, we discuss the uses of sample data in the field of data warehouse.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of the Analytical Queries Response Time in Real-Time Data Warehouse using Materialized Views Concatenation

A real-time data warehouse is a collection of recent and hierarchical data that is used for managers’ decision-making by creating online analytical queries. The volume of data collected from data sources and entered into the real-time data warehouse is constantly increasing. Moreover, as the volume of input data to the real time data warehouse increases, the interference between online loading ...

متن کامل

ارائه مدل تلفیقی برای ارزیابی آمادگی سازمان ها جهت پیاده سازی سیستم انباره داده با استفاده ازتحلیل سلسله مراتبی

Enterprise Data Warehouse initiative is a high investment project. The adoption of Data Warehouse will be significantly different depending upon the level of readiness of an organization. Before implementation of Data Warehouse system in a firm, it is necessary to evaluate the level of the readiness of firm. A successful Data Warehouse assessment model requires a deep understanding of opportuni...

متن کامل

Modeling and Simulation of Polyhydroxybutyrate Production by Protomonas extorquens in Fed-batch Culture

Modeling and simulation of Polyhydroxybutyrate (PHB) production by Protomonas extorquens in fed-batch culture were conducted in this research. The fed-batch model, developed for this process, employed a kinetic model proposed by other researchers. Several kinetic models were investigated to choose the best model. The criterion for this selection was goodness of fit (δ2). Haldane kinetic model w...

متن کامل

Model Selection for Mixture Models Using Perfect Sample

We have considered a perfect sample method for model selection of finite mixture models with either known (fixed) or unknown number of components which can be applied in the most general setting with assumptions on the relation between the rival models and the true distribution. It is, both, one or neither to be well-specified or mis-specified, they may be nested or non-nested. We consider mixt...

متن کامل

Modeling Ghotour-Chai River’s Rainfall-Runoff process by Genetic Programming

Considering the importance of water and computing the amount of rainfall runoff resulted from precipitation in recent decades, using appropriate methods for predicting the amount of runoff from rainfall date has been really essential. Rainfall-runoff models are used to estimate runoff generated from precipitation in the catchment area. Rainfall-runoff process is totally a non-linear phenomenon....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001